<!DOCTYPE html SYSTEM "about:legacy-compat">
<html manifest="pamflet.manifest">
      <head>
        <title>MongoDB+Hadoop Connector — MongoDB+Hadoop Connector</title>
        <link type="text/css" media="screen, projection" rel="stylesheet" href="css/blueprint/screen.css"></link>
        <link type="text/css" media="screen and (min-device-width: 800px), projection" rel="stylesheet" href="css/blueprint/grid.css"></link>
        <link type="text/css" media="print" rel="stylesheet" href="css/blueprint/print.css"></link> 
        <!--[if lt IE 8]>
          <link rel="stylesheet" href="css/blueprint/ie.css" type="text/css" media="screen, projection"/>
        <![endif]-->
        <link type="text/css" media="screen, projection" rel="stylesheet" href="css/pamflet.css"></link>
        <link type="text/css" media="print" rel="stylesheet" href="css/pamflet-print.css"></link>
        <link type="text/css" media="screen and (min-device-width: 800px), projection" rel="stylesheet" href="css/pamflet-grid.css"></link>
        
        <script src="js/jquery-1.6.2.min.js"></script>
        <script src="js/jquery.collapse.js"></script>
        <script src="js/pamflet.js"></script>
        
        <meta charset="utf-8"></meta>
        <meta name="viewport" content="width=device-width, initial-scale=1"></meta>
      </head>
      <body>
        <a class="page next nav" href="Frequently+Asked+Questions.html">
            <span class="space">&nbsp;</span>
            <span>❧</span>
          </a>
        <div class="container">
          <div class="span-16 prepend-1 append-1">
            <div class="top nav span-16 title">
              <span>MongoDB+Hadoop Connector</span> 
            </div>
          </div>
          <div class="span-16 prepend-1 append-1 contents">
            <h1 id="MongoDB%2BHadoop+Connector">MongoDB+Hadoop Connector</h1><p><strong>CURRENT RELEASE</strong>: 1.0.0-rc1
</p><p>The <em>Mongo+Hadoop Connector</em> (for brevitys sake, we’ll often refer to it as <em>mongo-hadoop</em> in this documentation) is a series of plugins for the <a title="Apache Hadoop" href="http://apache.hadoop.org">Apache Hadoop Platform</a> to allow connectivity to <a title="MongoDB" href="http://mongodb.org">MongoDB</a>. This connectivity takes the form of allowing both reading MongoDB data into Hadoop (for use in MapReduce jobs as well as other components of the Hadoop ecosystem), as well as writing the results of Hadoop jobs out to MongoDB. A forthcoming release will also allow for reading and writing static BSON files (ala <em>mongodump / mongorestore</em>) to allow offline batching; commonly, users find this to be a beneficial feature to run analytics against backup data.
</p><p>At this time, we support the “core” Hadoop APIs (now known as <a title="Hadoop Common" href="http://hadoop.apache.org/common/">Hadoop Common</a>), in the form of <em>mongo-hadoop-core</em>. There is additionally support for other pieces of the Hadoop Ecosystem, including <a title="Apache Pig" href="http://pig.apache.org">Pig</a> for ETL and <a title="Hadoop Streaming" href="http://hadoop.apache.org/common/docs/current/streaming.html">Streaming</a> for running Mongo+Hadoop jobs with Python (future releases will support additional scripting languages such as Ruby). Although it is not dependent upon Hadoop, we also provide a connector for the <a title="Flume" href="https://github.com/cloudera/flume/wiki">Flume</a> distributed logging system.
</p><h2 id="Support">Support</h2><p><em>mongo-hadoop</em> is dependent upon the MongoDB Java Driver — currently version 2.7.3.
</p><p>Bugs &amp; Features should be tracked and requested on the <a title="MongoDB Jira" href="https://jira.mongodb.org/browse/HADOOP/">MongoDB Jira</a>. If you have questions please email the
<a title="MongoDB User Mailing List" href="http://groups.google.com/group/mongodb-user">mongodb-user Mailing List</a>,
rather than directly contacting contributors or maintainers.
</p><h3 id="Maintainers">Maintainers</h3><ul><li>Brendan McAdams <brendan@10gen.com>
</li><li>Jeff Yemin <jeff.yemin@10gen.com>
</li></ul><h4 id="Contributors">Contributors</h4><ul><li>Eliot Horowitz  <erh@10gen.com>
</li><li>Ryan Nitz 
</li><li>Joseph Shraibman <jks@iname.com> (Sharded Input Splits)
</li><li>Sumin Xia <xiasumin1984@gmail.com> (Sharded Input Splits)
</li><li>Priya Manda <priyakanth024@gmail.com> (Test Harness Code)
</li><li>Rushin Shah <rushin10@gmail.com> (Test Harness Code)
</li><li>Sarthak Dudhara <sarthak.83@gmail.com> (BSONWritable comparable interface)
</li></ul><div class="tocwrapper show">
      <a class="tochead nav" style="display: none" href="#toc">❦</a>
      <a name="toc"></a>
      <h4 class="toctitle">Contents</h4>
      <div class="tocbody">
      <div class="current">MongoDB+Hadoop Connector</div><ol class="toc"> <li><div><a href="Frequently+Asked+Questions.html">Frequently Asked Questions</a></div></li><li><div><a href="Getting+Started.html">Getting Started</a></div><ol class="toc"> <li><div><a href="Building+the+Adapter.html">Building the Adapter</a></div></li><li><div><a href="Configuration+%26+Behavior.html">Configuration &amp; Behavior</a></div></li> </ol></li><li><div><a href="Hadoop+Streaming+Support.html">Hadoop Streaming Support</a></div><ol class="toc"> <li><div><a href="Building+Hadoop+Streaming+Support.html">Building Hadoop Streaming Support</a></div></li> </ol></li><li class="generated"><div><a href="Contents+in+Depth.html">Contents in Depth</a></div></li><li class="generated"><div><a href="Combined+Pages.html">Combined Pages</a></div></li> </ol></div></div>
          </div>
        </div>
        <a class="fork nav" href="http://github.com/mongodb/mongo-hadoop"><img alt="Fork me on GitHub" src="img/fork.png"></img></a>
      </body>
    </html>